Music Classification Using High-level Models

نویسندگان

  • N. Wack
  • C. Laurier
  • O. Meyers
  • R. Marxer
  • D. Bogdanov
  • J. Serrà
  • E. Gomez
  • P. Herrera
چکیده

We report here about our submissions to different music classification tasks for the MIREX 2010 evaluations. These submissions are similar to the ones sent at MIREX 2009 (see [1]), if we look at the classifiers and the main audio features. However we added high-level features (or semantic features), based on Support Vector Machine models of curated databases of different kind. We submitted two different algorithms evaluated on Mood, Genre and Artists classification. One of them is a classification algorithm using a weighted sum of Support Vector Machines. The other one is based on distances (Euclidean in a reduced space using RCA and Kullback Leibler on Mel Frequency Cepstrum Coefficients), together with K-NN. 1. FEATURE EXTRACTION This submission is coded in C++ and python. For the feature extraction part, we use an internal library of the Music Technology Group called Essentia [2]. This library contains all the features mentioned below. All frame-based statistics are aggregated using : mean and derivatives until second order, variance and derivatives until second order, minimum and maximum. We divide our features in two main categories. The ”base” features which are state-ofthe-art MIR features and the ”high-level” features. 1.1 Base features In Table 2 is the set of base features that performed the best in our preliminary experiment made on our genre, artist and mood databases. 1.2 High-level features One of the originality of our approach is the integration of high-level (or semantic) descriptors. Low level features are convenient and easy to extract. They provide satisfying classification results in many tasks. However, high-level concepts encapsulate different pattern of low-level descriptors into a single representation that can add useful information. Based on this idea, we added high level features Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. c © 2009 International Society for Music Information Retrieval. Type Features Low level barkbands spread, skewness, kurtosis, dissonance, hfc pitch and confidence, pitch salience, spectral complexity spectral crest, spectral decrease, energy, spectral flux spec spread/skewness/kurtosis, spec rolloff, strong peak ZCR, barkbands, mfcc, spectral contrast Rhythm bpm, beats loudness, onset rate Sound FX inharmonicity, odd2even, pitch centroid, tristimulus Tonal chords strength (frame), key strength(global), tuning freq Table 1. Feature set for all our classifiers. of different categories. These models are pre-trained algorithms using Support Vector Machines that are added to our bag of features. We consider them as other features with value between 0 and 1 corresponding to the SVM model prediction probability. Here we list the different models used:

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Identification and Classification of the Iranian Traditional Music Scales (Dastgāh) and Melody Models (Gusheh): Analytical and Comparative Review on Conducted Research

Background and Aim: Automatic identification and classification of the Iranian traditional music scales (Dastgāh) and melody models (Gusheh) has attracted the attention of the researchers for more than a decade. The current research aims to review conducted researches on this area and consider its different approached and obstacles. Method: The research approach is content analysis and data col...

متن کامل

Music Mood Classification Using Semantic Models

We report here about our submissions to the music mood classification tasks for the MIREX 2010 evaluations. Our classification algorithm is using a weighted sum of Support Vector Machines models. 1. FEATURE EXTRACTION This submission is coded in C++ and python. For the feature extraction part, we use an internal library of the Music Technology Group called Essentia [2]. This library contains al...

متن کامل

Music Genre Classification Using Text Categorization Method

Automatic music genre classification is one of the most challenging problems in music information retrieval and management of digital music database. In this paper, we propose a new method to classify music genres using text categorization methods. Differing from previous solutions which were mainly based on analysis on acoustic or symbolic audio signal, here we consider music as a text-like se...

متن کامل

Features for Audio Classification

Four audio feature sets are evaluated in their ability to differentiate five audio classes: popular music, classical music, speech, noise and crowd noise. The feature sets include low-level signal properties, mel-frequency spectral coefficients, and two new sets based on perceptual models of hearing. The temporal behavior of the features is analyzed and parameterized and these parameters are in...

متن کامل

Features for audio and music classification

Four audio feature sets are evaluated in their ability to classify five general audio classes and seven popular music genres. The feature sets include low-level signal properties, mel-frequency spectral coefficients, and two new sets based on perceptual models of hearing. The temporal behavior of the features is analyzed and parameterized and these parameters are included as additional features...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010